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ABSTRACT 

The DEEP2 project will obtain redshifts for ^60,000 galaxies between z ~ 0.7-1.5 in a comoving 
volume of roughly 7 10 6 Mpc 3 h~ 3 for a ACDM universe. The survey will map four separate 2° by 
0.5° strips of the sky. To study the expected clustering within the survey volume, we have constructed 
mock galaxy catalogs from the GIF and Hubble Volume simulations developed by the Virgo consortium. 
We present two- and three-point correlation analyses of these mock galaxy catalogs to test how well we 
will measure these statistics, particularly in the presence of selection biases which will limit the surface 
density of galaxies which we can select for spectroscopy. We find that although the projected angular 
two-point correlation function w(9) is strongly affected, neither the two-point nor three-point correlation 
functions, £(r) and £( r )i are significantly compromised. We will be able to make simple corrections to 
account for the small amount of bias introduced. We quantify the expected redshift distortions due to 
random orbital velocities of galaxies within groups and clusters ("fingers of god") on small scales of ~1 
Mpc hr x using the pairwise velocity dispersion <j\i and galaxy- weighted velocity dispersion o\. We are 
able to measure <j\ to a precision of ~10%. We also estimate the expected large-scale coherent infall of 
galaxies due to supercluster formation ("Kaiser effect"), as determined by the quadrupole-to-monopolc 
ratio ^2/Co of £(r p , n). From this measure we will be able to constrain (3 to within ~0.1 at z=l. 

For the DEEP2 survey we will combine the correlation statistics with galaxy observables such as 
spectral type, morphology, absolute luminosity, and linewidth to enable a measure of the relative biases 
in different galaxy types. Here we use a counts-in-cells analysis to measure as as a function of redshift 
and determine the relative bias between galaxy samples based on absolute luminosity. We expect to 
measure a% to within 10% and detect the evolution of relative bias with redshift at the 4-5cr level, with 
more precise measurements for the brighter galaxies in our survey. 

Subject headings: galaxies: distances and redshifts — large-scale structure of the universe — surveys — 
galaxies: statistics — galaxies: evolution 



1. INTRODUCTION 

The DEEP2 survey (Davis et al. 2000) will commence 
in the spring of 2002, with the anticipated delivery of the 
DEIMOS spectrograph (Cowley et al. 1997) to Keck in 
the fall of 2001. The survey will use approximately 120 
Keck nights over a period of three years to study galaxy 
properties, evolution, and large-scale structure at z ~ 1. 
The DEEP2 survey will obtain spectra for ~60,000 galax- 
ies between z « 0.7-1.5. The current plan is to use an 830 1 
mm -1 grating with 0.75 " slitwidths, resulting in a spectral 
resolution of 86 km s _1 . We thus expect to easily measure 
linewidths to this limit and redshifts to 10% of this limit. 
The survey will target four 2° by 0.5° fields, which cor- 
responds to a total comoving area of 80 by 20 hr 2 Mpc 2 
at z ~ 1 for a ACDM universe. The line-of-sight comov- 
ing distance for z ss 0.7-1.5 is ^1400 h^ 1 Mpc, though 
in our magnitude-limited survey we will not sample the 
higher-redshift region very densely. The four fields were 
chosen as low-extinction regions which are continuously 
observable from Hawaii over a six month interval. One 
field is the Groth Survey strip, and two fields are on the 
equatorial strip which will be deeply imaged by the Sloan 
Digital Sky Survey (SDSS). Each of these fields are tar- 
gets of a CFHT survey conducted by Kaiser and Luppino, 



with the primary goal of deep imaging in B, R, and I bands 
for weak lensing studies (Wilson et al. 2000). We will use 
their photometric information to color-select galaxies with 
z > 0.7 and mi(AB) < 23.5 for the DEEP2 survey. This 
color-selection allows us to focus on the high-z universe. 

One of the key science goals of the DEEP2 survey is to 
measure the two-point and higher-order correlation func- 
tions of galaxies at z ~ 1 as a function of other observables. 
In almost all models of structure formation (e.g., White, 
Davis, Efstathiou, & Frenk (1987)), galaxies are born as 
highly biased tracers of the mass distribution, but their 
bias diminishes with time. Spiral galaxies today appear to 
be weakly biased, if at all, while the clustering of z ~ 3 
Lyman-break galaxies requires a large bias for any reason- 
able cosmological model (Giavalisco et al. 1998; Wechsler 
et al. 1998). Galaxies at z ~ 1 should have an intermediate 
degree of bias, with readily observable consequences. 

The statistical clustering of galaxies at any epoch de- 
pends on both the underlying dark matter distribution 
and the biasing function; the two are degenerate without 
more information. However, models suggest that patterns 
exist in the relative biases between various types of galax- 
ies, and measuring these patterns as a function of absolute 
luminosity, linewidth, or spectral type can break that de- 
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generacy. With sufficiently dense sampling, determining 
the clustering properties of galaxies can yield direct mea- 
surements of their biasing. For example, in linear biasing 
models [where (6p/p) ga i is assumed to be some constant 
b x (5p/p) ma t\, the mean ratio of the skewness of the den- 
sity distribution function to the square of its variance has 
an expectation value that scales as 1/6 (Fry 1996; Scoc- 
cimarro et al. 1998). We will be able to subdivide the 
DEEP2 sample as a function of internal linewidth and es- 
timate b for each type of galaxy. The sampling density 
of the survey is such that we will be able to obtain spec- 
tra for three-quarters of all galaxies down to L* + 0.5 at 
the redshifts of interest. Comparison of such measures at 
z <~ 1 to their values at z — will then be possible. 

DEIMOS has a field of view of 16' by 5' for both imaging 
and multislit spectroscopy and will use an array of eight 2k 
by 4k Lincoln Laboratory 30 pm thick CCDs, with an ac- 
tive 2-D flexure compensation system to minimize fringing 
at red wavelengths. As the DEEP2 survey will target four 
fields of 120' by 30', slitmasks will be made from a region 
of size 16' by 4' with two rows of 60 masks each. There will 
be ~130 slitlets per mask, with most of the slitlets aligned 
along the long axis, but with some tilted as much as 30° 
to track extended galaxies in order to measure rotation 
curves. The mean surface density of candidate galaxies ex- 
ceeds the number of objects we can select, and spectra of 
selected targets cannot be allowed to overlap on the CCD. 
Adjacent slitmasks will overlap by 2 ' so that each galaxy 
will have two chances to be selected for a mask. Spectra 
will be obtained for ^70% of the targeted galaxies in the 
fields meeting our color and magnitude selection criteria, 
chosen by a slitmask algorithm which is necessarily biased 
against the highest-density regions, where the spectra of 
nearby galaxies would overlap on the CCD. Here we test 
how this bias will affect our measurement of the underlying 
two-point and three-point correlation functions. 

This paper uses a family of mock catalogs to model 
large-scale structure studies that will be possible with the 
DEEP2 survey data. Section 2 describes the construction 
of the mock catalogs while section 3.1 provides results of 
analysis of two-point correlation studies in £(r p ,7r). The 
high resolution spectroscopy of DEEP2 will allow redshift 
space distortions to be readily measured, as discussed in 
section 3.2. This in turn will allow for a measurement of 
the small-scale thermal velocities &yi and <j\ , described in 
section 3.3. In section 3.4 we show that we can measure the 
three-point correlation function £(r) with minimal bias. 
Section 3.5 discusses the ability to measure the evolution 
of clustering within the volume of the DEEP 2 survey. We 
expect to have the sensitivity to detect evolving bias and 
evolving as- Sections 4 and 5 present general discussion 
and conclusions. 

2. CONSTRUCTION OF MOCK CATALOGS 

We constructed mock catalogs from the Hubble Vol- 
ume and GIF simulations made by the Virgo Consor- 
tium (www.MPA-Garching.MPG.DE/Virgo/). To match 
the DEEP2 survey, our mock catalogs cover 2° by 0.5° 
on the sky, with a redshift range of z=0.7-1.5. For both 
simulations we use the ACDM models with J7 maMer =0.3, 
fi A =0.7, h=0.7, and er 8 =0.9. The Hubble Volume Project 
produced deep-wedge geometry lightcone outputs from 



simulations of 10 9 particles which include smooth clus- 
tering evolution as a function of redshift (Evrard et al. 
2000). The lightcone output is designed to match what 
an observer would see and uses propagating output filters 
at the speed of light through the simulation in order to 
produce images which have evolution in the redshift direc- 
tion. The ACDM simulation covered a comoving cube of 
length 3000 hr 1 Mpc, and we constructed 16 independent 
catalogs from subsets of the lightcone output. The Hub- 
ble Volume simulations have a mass resolution of 2.25 10 12 
hr 1 M so iar per particle and a softening scale of 100 kpc 
h . We only use particles in the simulation which are 
labeled "galaxies" in the L2 bias model, determined from 
the initial overdensity field at that location as described 
in Yoshida et al. (2000). This bias model has both a lower 
and upper threshold cutoff in the overdensity field inside 
of which "galaxies" will form. This results in an enhance- 
ment of voids as well as creates a small anti-bias needed in 
ACDM models to match observed galaxy correlations on 
small scales (Jenkins et al. 1998). 

We also use simulations from the GIF project, which 
combine much higher-resolution N-body simulations with 
semi-analytic models to study formation and evolution of 
galaxies Kauffmann et al. (1999). The particle mass of 
the GIF simulations is 10 10 M so i ar . These simulations are 
in the form of "snapshots" of comoving cubes at various 
redshift intervals. The cubes are 141 /i _1 Mpc on a side, 
and we use outputs at z=0.62, 0.82, and 1.05. In order 
to produce catalogs which continuously cover z=0.7-1.5, 
we had to stack two z=0.62 cubes, three z=0.82 cubes, 
and five z=1.05 cubes in the redshift direction. Since the 
cubes can be randomly oriented, and 141 h^ 1 Mpc corre- 
sponds to ~ 5° at z=0.6 and 3.5° at z=1.0, we were able 
to select 2° by 0.5° subsets of different orientations from 
each cube when stacking them, thereby reducing structure 
replication in the redshift direction in our mock catalogs. 
We constructed 6 separate mock catalogs from the GIF 
simulations, roughly equal to the number of independent 
catalogs which could be made from the GIF galaxy sim- 
ulation output. There is, however, some replication at 
z > 1.0, where we had to stack more cubes from the same 
snapshot output. 

To assign each galaxy a redshift, we assumed a ACDM 
cosmology with the following relation between comoving 
distance and redshift: 

c f dz 

r = / -^^^=^^^= (1) 

Ho J y/(i + z ) 3 n m + n A 

where r is the comoving distance in units of h^ 1 Mpc and 
f2 m =0.3 and fiA=0.7. To analyze both the Hubble Vol- 
ume and GIF simulation mock catalogs in redshift space 
we added the peculiar velocity along the line of sight, v p 
(km s -1 ), to the zero-peculiar-velocity-redshift, z : 

z = z + ^(l + z ). (2) 

c 

In the analysis of the redshift space catalogs, the relations 
between z and r are inverted, and we convert the new z 
to a given r. Note that one must assume an underlying 
cosmology to analyze high-redshift catalogs, and if the as- 
sumed model is incorrect the resulting statistics will be 
distorted. 

As the simulations provide a volume-limited sample, we 
apply a selection function to our mock catalogs to mimic 



the a priori unknown selection function of the magnitude- 
limited DEEP2 survey. For the Hubble Volume simula- 
tions we follow the prescription of Postman et al. (1998) 
for N(z) models and integrate a Schechtcr luminosity func- 
tion in redshift bins. We conservatively use a model with 
no luminosity evolution as a pessimistic case from which to 
estimate the correlation amplitudes and errors, as the no- 
evolution model has fewer galaxies at high-z than models 
with evolution. After applying this selection function, we 
then randomly dilute the density of galaxies uniformly at 
all redshifts to match the observed surface density of high- 
z objects in our CFHT photometry fields of ~ 5 galaxies 
arcmin~ 2 . We expect to successfully measure redshifts 
for approximately 80% of our observed sample of galaxies 
in the survey, and so we further dilute the mock catalog 
by an additional 20%. The resulting mock catalogs, each 
corresponding to one of our four fields, contain ^18,000 
galaxies. In Figure 1 we plot one of our mock Hubble 
Volume catalogs in both real and redshift space, collapsed 
along the smallest axis. The no-evolution selection func- 
tion applied to the mock catalogs is shown in figure 2a, 
along with the redshift distribution of galaxies in one of 
the catalogs in figure 26. 

The GIF simulations contain absolute magnitudes in B, 
V, R, I and K for each galaxy, which we use to create a 
flux-limited subsample from our mock catalogs. We make 
an apparent S-band cut of 23.4 mag, which at z ~1 cor- 
responds to an apparent /-band mag with the appropriate 
K-correction, resulting in flux-limited catalogs of ^19,000 
galaxies. An image of one of our GIF mock catalogs is 
shown in figure 3 in redshift space. The redshift distribu- 
tion of these catalogs agrees quite well with that of the 
Hubble Volume mock catalogs. Figure 3 also shows the 
results of our target selection on one of the GIF mock 
galaxy catalogs. The middle panel shows galaxies which 
would be targeted to be on a slitmask, while the lower 
panel shows galaxies for which we would not be able to 
obtain spectra. The map of galaxies missed by our selec- 
tion procedure traces the same structure as the map of our 
targeted galaxies. 

3. RESULTS 

The GIF simulations match our current CFHT photom- 
etry quite well in terms of the projected angular correla- 
tion w(9), notably at small separations of <20 arcsec. This 
holds for all the galaxies as well as subsamples selected by 
color. We therefore use the GIF mock catalogs to esti- 
mate how well we will measure the two- and three-point 
correlation functions and their redshift distortions, and to 
test the effects of our target selection algorithm, which is 
most strongly biased on small projected scales on the sky. 
The Hubble Volume simulations do not have the mass res- 
olution to match the projected correlation amplitudes on 
small scales but do have the advantage of continuous clus- 
tering evolution with redshift. We use both the Hubble 
Volume and GIF simulations to estimate how well we will 
measure o% as a function of redshift in the DEEP2 survey. 

3.1. The Two-point Correlation Function and Bias of 
our Slitmask Algorithm 

The two-point correlation function is defined as the ex- 
cess probability above random that a pair of galaxies ex- 



ists with a given separation (see Peebles 1980 for details). 
This statistic is a simple way to characterize the amount 
of clustering seen in the galaxy distribution. We present in 
figure 4 the average two-point correlation analysis £(r p , w) 
of the six GIF mock galaxy catalogs, as a function of sep- 
aration across the line of sight (r p ) and along the line of 
sight (71"). The thick solid contour traces where the cor- 
relation amplitude is equal to 1.0, and the scale length of 
the clustering is roughly 6.0 h^ 1 Mpc in real space as seen 
on the Tp-axis. The dotted contours are Ict standard de- 
viations derived from the six independent catalogs. These 
contours are quite confined, confirming that the DEEP2 
design will lead to strong constraints on the clustering of 
distant galaxies. There is, however, significant covariance 
among the residuals. The covariance matrix is comprised 
of correlation coefficients r,-,- 
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where a%j is the covariance between points i and j and o~i 
is the square root of the variance of point i. The values 
of the covariance matrix are high, above 0.75 for scales of 
5-20 h~ x Mpc. This is not surprising, suggesting that 
poorly sampled large-scale modes dominate the errors. Er- 
ror analysis in the real data will need to account for this 
covariance. 

A critical function of the GIF mock catalogs is to test the 
effects of the target-selection algorithm on the measured 
two-point correlation function. The results are shown in 
Figure 5. Here we plot the mean two-point correlation 
amplitude £(r p ,7r) for all the galaxies in our catalogs in 
solid contours as well as for only those galaxies selected 
to be on slitmasks in dotted contours. The correlation 
amplitude for targeted galaxies alone is lower than for all 
the galaxies, as expected. We also show in figure 5 the 
result of a simple correction for this bias. For galaxies not 
targeted to be on slitmasks, we substitute the redshift of 
the nearest neighbor on the sky which was targeted and 
is within the expected photometric redshift error of z=0.1 
of the un-targeted galaxy and use its redshift as that of 
the un-targeted galaxy. In this way we use the angular in- 
formation and the photometric redshift of the un-targeted 
galaxies and can partially recover the amplitude errors in 
the correlation analysis. 

This correction works extremely well on large scales, >5 
h^ 1 Mpc. The correction underestimates the "finger of 
god" effect on small scales (<5 hr 1 Mpc), where it ef- 
fectively positions the un-targeted galaxies at the iden- 
tical physical distances of the nearest neighbors on the 
sky, thereby reducing the effect of random thermal mo- 
tion. The conclusion is that the target-selection bias can 
be corrected in a simple manner on scales >5 h" 1 Mpc. 
The most straightforward means of dealing with the re- 
sulting bias on small scales will probably be to filter all 
models by the same selection procedure, to estimate the 
degree to which the targeted galaxies are underestimating 
the correlation amplitude. 

3.2. Redshift Distortions in the Measured Two-point 
Correlation Function 

The mock catalogs can be used to quantify the preci- 
sion with which wc can measure redshift distortions in the 



two-point correlation function £(r p , it) in the DEEP2 sur- 
vey. Redshift distortions are expected on small scales due 
to the random internal velocities of galaxies in groups and 
clusters, creating so-called "fingers of god" on small scales 
(< 5 h^ 1 Mpc). This effect can be seen in the redshift 
two-point correlation function £(r p ,7r) shown in figure 4 
as the elongation along the y-axis (separation along the 
line of sight) at small scales. On larger scales a flatten- 
ing of the two-point correlation function contours can be 
seen, which is due to coherent infall of galaxies resulting 
from the gravitational pull of large forming superclusters. 
A standard method by which to quantify these redshift- 
space distortions is to measure ^2/^0: the quadrupole to 
monopolc moments of the two-point correlation function 
(Hamilton 1998). The multipolc moments of the two-point 
correlation function are defined as 

&(*) = (2J + l)/2 U(r p ,n)P l (cose)dcose. (4) 

^2/^0 is greater than zero on small scales where the "fin- 
gers of god" are apparent and less than zero on large scales 
where coherent infall appears. Figure 6 plots ^2/^0 as a 
function of redshift separation, with la errors as derived 
from the six separate GIF mock catalogs. Unlike the mea- 
surement of £(r p , 7r), the covariance matrix is large only on 
large scales, greater than 12 /i _1 Mpc. The bias of our tar- 
get selection procedure can be seen as the dotted line, and 
the correction discussed in the last section is shown with a 
dashed line. As with the £(r p , it) statistic, the effect of tar- 
get selection is to slightly decease the amplitude of £2 /£o 
on all scales. The applied correction increases $2/^0 on 
scales >5 h _1 Mpc and underestimates it on small scales. 
We will need to apply a bias correction to account for this 
in the survey. However, the mock catalogs show clearly 
that the correlation anisotropy will be robustly detected 
in the DEEP 2 survey. 

The amount of flattening seen on large scales constrains 

Q0.6 

the parameter (3 = — P-, where b is the bias between the 
galaxy and dark matter clustering. In linear theory, 



&/& = /(n) 
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(5) 
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where f(n) =(3+n)/n and n is the index of the two-point 
correlation function in a power-law form: £ ex r _ ( 3 +™) 
(Hamilton 1992). Our la errors plotted in figure 6 show 
that in one of our slices we should be able to measure (3 
to within +/- 0.2, so that for our full sample of four slices 
our error on (3 should be +/- 0.1. In order to extract £l m 
from our measurement of [3 we will need to know the ab- 
solute bias of our galaxy sample, which we may be able 
to determine from our measurements of the three-point 
correlation function relative to the two-point function (see 
section 3.4). 

3.3. Pairwise and Object-weighted Velocity Dispersion 

The pairwise velocity dispersion is a parameter which 
measures the small-scale thermal motions of galaxies and 
probes the mass density of the universe. It has been mea- 
sured to be around 400-500 km s _1 for various redshift 
surveys at \ow-z (e.g. Fisher et al. 1994; Marzke et al. 
1995; Jing, Mo, & Borner 1998). In this analysis, we con- 
fine our test to a measurement of the pairwise velocity 



dispersion 012 by comparing £(r p ,7r) in real and redshift 
space on the scale of r p =l h~ x Mpc, where the "finger of 
god" effect dominates redshift distortions. (In this section 
we will use a subscript R to indicate real space.) For this 
analysis we use the flux-limited mock catalog before slit 
selection, averaged over all 6 catalogs. 

Following Fisher et al. (1994), we first define 



£(r p = 1,tt) = 0.5[£(r p = 0.5Mpc h' 1 , 7r)+£(r p 



in both real and redshift space, £i?(r p 
l,7r), for values of it < 20 hr 1 Mpc. 



1.5Mpc/i _1 ,7r)] 

(6) 



= 1, 7r) and £(r 
We then normalize 



£(r p = 1, 7r) so that our subsequent fitting will be sensitive 



to the overall shape of £(r p = 1, 
amplitude: 



but insensitive to the 
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(7) 



where ir max is the maximum value of it used in the analysis 
(here 7r ma:r =20 hr 1 Mpc). 

Within the mock catalogs, the pair-weighted dispersion 
of the known peculiar velocities is 500 km s _1 for galaxies 
with radial separations <20 ft. -1 Mpc within a projected 
distance of 1 h^ 1 Mpc, as measured in real space. The 
mean infall velocity, v±2, is 160 km s _1 at the same scale. 
To determine a\2 in the redshift space catalog, we con- 
struct models of £(71") by convolving the measured £,r(tt) 
in real space with different distribution functions of veloc- 
ity differences for pairs of galaxies separated by vector dis- 
tance r to compare with the measured £(7r). Let the vector 
r separating two galaxies be decomposed into a transverse 
component r p and a line-of-sight component y, where y 
can be converted into a velocity separation at the given 
redshift. Then the velocity difference is given as n-y, and 
the convolution becomes 
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where r 2 = 



(8) 



r p + V ■ F° r the mean infall velocity, Vi2(r), 
we use the similiarity solution of Davis & Peebles (1983), 
v 12 ~ r/[l + (r/r ) 2 } with r =5.83 h" 1 Mpc. At z=l in 
a LCDM model this corresponds to 71 km s _1 . Figure 7 
plots the measured £(tt) with models of a\i equal to 350, 
400, and 450 km s _1 . Using a minimum chi-squared test, 
we find that the best fit is CTi2=410 km s _1 with a la error 
of 80 km s _1 . If we instead use a mean infall velocity of 
i>i2(r)=160 km s _1 at r=l h^ 1 Mpc, we find cri2=485 km 
s _1 , consistent with the measured peculiar velocity disper- 
sion in the mock catalog of 492 km/s. Note that redshift 
space distortions are larger at high redshift than at low 
redshift by the factor (1 + z), and that we have removed 
this extra factor in our modeling of a 12 by simply using 

1+2=2. 

The CT12 statistic is known to be unstable, however, as 
it is pair-weighted and therefore quite sensitive to rare, 
rich galaxy clusters (Davis, Miller, & White 1997). Jing & 
Borner (1998) also argue that the measured value of a\i 
depends strongly on the assumed value of vyi, the mean 
infall velocity, which here we have modeled in an adhoc 
manner. We therefore also measure the galaxy-weighted 
velocity dispersion a\ , which is a more stable statistic and 
a measure of the one-dimensional peculiar velocity disper- 
sion of galaxies relative to their neighbors. This statistic 



r p = 1 h 

results for r p =2 hr 1 



is more easily evaluated and does not rely on assumed val- 
ues of the mean infall. Following the prescription of Baker, 
Davis, and Lin (2000) we construct the velocity distribu- 
tion D(Av) by a weighted sum over N g galaxies: 

1 N ' 

D(Av ^ = w^ m [Pi{Av) - Bl{Av)] (9) 

^ i— 1 

where Wi is the weight for each galaxy, determined by the 
number of neighbors in excess of a random background. 
Here Pi(Av) is the distribution of neighboring galaxies 
within a cylinder of projected radius r p and half-length 
Vi in redshift space, while Bi(Av) is the background dis- 
tribution expected for an unclustered galaxy distribution. 
Baker et al. (2000) describe a weighting scheme where 
galaxies for which the distribution is negative can be in- 
corporated into D(Av) by considering the high and low 
density samples separately and combining them, weight- 
ing by the number of objects included in each sample. This 
statistic results in an unbiased, object- weighted measure of 
the thermal energy of the galaxy distribution. The veloc- 
ity distribution D(Av) can be seen in figure 8. To measure 
the intrinsic dispersion, o~\ , we model the real space corre- 
lation function £(r) convolved with an exponential velocity 
broadening function, 

f(v) = -exp(-M) (10) 

CTi CTi 

(see Davis, Miller, & White (1997) and Baker ct al. (2000) 
for details). We measure <ti=180 +/--20 km s _1 for a 
1 Mpc and vi =2500 km s _1 . We find similar 
Mpc. The model fit for eri = 180 km 
s _1 is shown in figure 8. 

This value of o\ should be compared to the 1-D rms 
peculiar motion of 315 km s _1 for the GIF mock galaxies 
in the catalog. This rms peculiar motion includes large 
scale flows, while <j\ measures only the nonlinear thermal 
component of the peculiar velocities. The difference is of 
the magnitude expected. Note that Baker et al. (2000) 
measured o\ = 126 ± 10 km s _1 for the LCRS survey, and 
it is quite likely that DEEP2 will report a value lower than 
this for the distant Universe. This might become a modest 
mismatch between the GIF simulations and reality. 

3.4. The Three-point Correlation Function 

The highly non-Gaussian nature of the galaxy distribu- 
tion can be described with the hierarchy of A-point cor- 
relation functions. In particular, the three-point correla- 
tion function contains a wealth of information on gravita- 
tional collapse, initial conditions of the power spectrum, 
and most importantly, bias and galaxy formation. In this 
paper we present preliminary measurements of the three- 
point correlation function in our DEEP2 GIF mock galaxy 
catalogs. Our goal is to show that the three-point correla- 
tion function can be measured with sufficient accuracy to 
be interesting and that the our selection procedure does 
not systematically bias our results to a degree that would 
hinder proper interpretation. 

The three-point correlation function is defined as the re- 
duced joint probability of finding a triplet of galaxy at a 
particular configuration (Peebles 1980). We have used the 
estimator by Szapudi & Szalay (1998) for extracting the 
three-point function: 

(=(DDD-3DDR + 3DRR- RRR)/RRR. (11) 



In this equation contributions to a triplet from the data 
and a random set are denoted with a D and R respectively. 
The above estimator is expected to be the most accurate 
edge-corrected formula in the family of estimators based 
on triplets. 

For our measurements of the GIF mock catalogs we 
chose eight logarithmic bins centered on 0.613 h^ 1 Mpc 
to 10.779 h" 1 Mpc. All possible triangles with those sides 
(allowed by the triangle inequality) were used, i.e. 76 mea- 
surements were performed for each sample. The fast Ap- 
point correlation code by Moore et al. (2000) based on a 
novel double tree algorithm was used for the calculations. 

We have divided the GIF mock galaxy catalogs into four 
redshift ranges z=0.7-0.9, 0.9-1.1, 1.1-1.3, and 1.3-1.5. 
For each redshift range, we used the six independent cat- 
alogs to enable a calculation of the cosmic (co)variances. 
Finally, for each simulation we calculated the three-point 
correlation function for all the galaxies in the catalog, as 
well as for only those galaxies which would be selected to 
be observed using our target selection slitmask algorithm. 
We used four times as many random points as galaxies for 
our measurements. 

Figure 9 contains the binned results for the shallowest 
slice, z=0.7-0.9. For this preliminary investigation the re- 
sults are interpreted in terms of the simple hierarchical 
assumption (Davis & Peebles 1983; Peebles 1980), 



C = Q 3 (66 + 66 + 66) 



(12) 



The three-point function is plotted in terms of the hierar- 
chical term on the right. If this equation were exact the 
result would be a straight line, as illustrated with a dotted 
line for the case Q3 = 1. Note that Q3 shows a weak scale 
dependence. 

According to figure 9, the hierarchical assumption gives 
a good qualitative description of the data. The tight rela- 
tion illustrates the quality obtainable by the DEEP2 red- 
shift survey. The scatter is due to measurement errors, 
cosmic variance, and an intrinsic shape dependence of the 
three-point correlation function with scale. The increasing 
error bars on large scales is inevitable because of cosmic 
variance. With a cross-scan size of 20 h^ 1 Mpc, triangles 
with baselines >10 hr x Mpc will be poorly contrained by 
the DEEP2 survey. Figure 9 shows that the effect of the 
target selection bias is to decrease ( by ^20%. This, of 
course, will have to be carefully modelled in analyses of 
the real data. 

Quantifying and separating the intrinsic shape depen- 
dence of the £ with scale is the key to determining bias 
from the three-point correlation function. Analysis on 
quasilincar scales (Fry 1994) suggests that C can be used 
for a determination of the galaxy bias that is independent 
from studies of kinematics or redshift-space distortions. In 
this way we will hopefully be able to measure absolute bi- 
ases instead of relative biases. This is left for a subsequent 
paper; here our purpose is mainly to illustrate the fea- 
sibility of a high accuracy three-point measurement with 
the DEEP2 redshift survey, and to show that the incom- 
pleteness caused by the slitmask algorithm does not cause 
significant, uncontrollable systematic errors. 



3.5. Measuring as and Relative Galaxy Bias as a 
Function of Redshift 

We expect to measure the evolution of a% as a function 
of redshift for the galaxies observed in the DEEP2 survey. 
As a simple test of the sensitivity of this measure, we use 
a counts-in-cells analysis to calculate as in both our Hub- 
ble Volume and GIF simulations mock galaxy catalogs in 
redshift bins of Az=0.1. The counts-in-cells method (Pee- 
bles 1980) uses the moments of the counts of numbers of 
galaxies in spheres of a given size to quantify the amount 
of galaxy clustering present. 

By counting objects in randomly-placed spherical cells 
(Ni) of radius I, one can determine the moments of the 
counts: 

Mi =< N > (13) 



M2 =< {N t - < N, >) 2 > 



(14) 



For each mock catalog, we created 40,000 spherical cells 
with radii of 8 Mpc hr x and randomly placed them 
throughout the catalog, ensuring that the centers of the 
cells were at least 8 h^ 1 Mpc away from the boundary of 
the mock catalog. In this way none of the cells overlapped 
the catalog boundaries. We then divided the mock catalog 
into slices of Az=0.1 bins (eg. 0=0.7-0.8) and calculated 
the moments of the galaxy counts and as as a function 
of redshift. These moments are related to the volume- 
integrated two-point correlation function and as as: 



^2=^1+ Mi6 



07 



(15) 
(16) 



We performed this analysis on both the Hubble Volume 
and GIF mock catalogs, as each has an emphasis on differ- 
ent properties which will be relevant in our DEEP 2 survey 
data. The GIF mock catalogs do not have continuous evo- 
lution as they are made from three snapshot outputs at 
different redshifts stacked in z-space. However, we were 
able to use the absolute magnitudes of the galaxies to ap- 
ply a realistic magnitude- limited selection function, and as 
a consequence the galaxies at high- z are intrinsically more 
luminous than those at low-z. Since bias is expected to be 
luminosity-dependent, the inherent bias of the galaxies in 
the GIF catalogs will change with redshift. The Hubble 
Volume catalogs, however, have realistic continuous evolu- 
tion in their light-cone output, but the bias should change 
more slowly with redshift as the galaxy selection is the 
same at all redshifts. 

To take into account the changing selection function 
with redshift, for the GIF mock catalogs we created a 
volume-limited sample within each redshift bin. We used 
the absolute B-band magnitude to keep only those galaxies 
which are bright enough so that they could have been ob- 
served at the far end of the redshift bin. For the Hubble 
Volume mock catalogs we did not have magnitudes and 
so were not able to create volume-limited samples. The 
Hubble Volume simulations therefore have a slight density 
gradient within each redshift bin. We tested the effects 
of this small density gradient on the counts-in-cells results 
from the GIF mock catalogs with and without volume- 
limited samples and found a 1% difference in the values of 
as in the volume-limited sample, and therefore conclude 
that it is a negligible effect. 



The results for the Hubble Volume mock galaxy cata- 
logs are shown in figure 10a. The solid line is the mean 
as as measured in our sixteen independent mock catalogs, 
while la errors as determined from the variance between 
the different catalogs are shown in dashed lines. The ex- 
pected linear evolution of erg in the underlying dark matter 
is shown as a dash-dot line (normalized to 0.9 at z=0, as 
were the simulations; see Fry 1996 for analytic formula). 
If we fit our estimates of as{z) of the galaxies to the ana- 
lytical form 



as{z)=A 



l + z 



(17) 



we find A=0.64 (1<t=0.01) and a=-0.51 (0.10). The co- 
variance between A and a is low; the off-diagonal elements 
of the covariance matrix are equal to -0.36. 

An evolution of the bias between the galaxies and the 
dark matter is apparent in the shallower slope of the galaxy 
clustering evolution compared to the dark matter linear 
evolution prediction. Figure 10b is a plot of the estimated 
bias, equal to <78(galaxies)/cr8(dark matter). The bias is 
slightly larger at high- z, as is expected from hierarchical 
structure formation models. An analytic fit to the slope of 
the bias evolution results in a best fit of A=1.16 (la=0.01) 
and a=0.37 (0.10), with the same low covariance as the 
as analysis. Evolution of bias is therefore expected to be 
present at the 4a level within a single slice of the DEEP2 
survey, but separation of the changing bias from the evolv- 
ing matter distribution will be complex. For a given cos- 
mological model, we can use linear theory to calculate the 
latter quantity. 

The evolution of as for the GIF mock catalogs is shown 
in figure 11. The best fit to the analytic form for the 
evolution of a 8 is A=0.80 (0.01) and a=0.38 (0.13). The 
change of bias with redshift is best fit by A=1.45 (0.01) 
and a=1.26 (0.13). Here we are using mock galaxies of 
all luminosities, so that the bias changes with redshift not 
primarily because of evolution but because at high-z we 
measure only the intrinsically more luminous galaxies in 
our magnitude- limited sample, and these objects are more 
clustered at all redshifts. This is an effect we will see in the 
DEEP2 survey, where we will have to subdivide the galax- 
ies by linewidths or luminosity to separate the effects of 
bias evolution from differing bias in different galaxy pop- 
ulations. 

In figure 12 we show the B-band luminosity function 
(LF) of one GIF mock catalog in different redshift bins. 
At lower redshifts, the LF extends to galaxies with abso- 
lute B mag of -19, but at higher redshifts only the brightest 
galaxies are seen. This change of the LF with redshift is 
primarily due to our apparent magnitude sample selection, 
though there is some inherent evolution in the LF which 
is included in the GIF simulations. We made subsamplcs 
of the galaxy catalogs based on absolute B-band magni- 
tude and show the bias evolution for different magnitude 
ranges in figure 13. There is a clear trend between galaxy 
magnitude and the amplitude as well as evolution of bias. 
The brighter galaxies have a higher bias as well as stronger 
evolution. The values of the amplitude of the bias at z—1 
(A) and the slope of the evolution (a) are given in the 
figure, with la errors in parentheses. We are able to mea- 
sure relative bias between galaxy types to within 0.02 and 



measure evolution of bias at the 5-6tr level for the bright- 
est galaxies (-24<B mag<-21) and at the 3-4cr level for 
the fainter galaxies (- 21<B mag<-19). Note that these 
values for the bias are calculated in rcdshift space. 



parameter space plane (Newman et al. 2001). Simulta- 
neous execution of all these tests with the DEEP2 survey 
data is thus capable of setting rather tight constraints on 
w and fl m . 



4. DISCUSSION 

A key goal of the DEEP2 redshift survey of galaxies is 
to constrain the evolution of large-scale structure in the 
universe. In most cosmological models, the galaxy distri- 
bution is expected to evolve measurably between z = 
and z = 1 , and different types of galaxies are expected to 
evolve independently. The survey will hopefully be suffi- 
ciently robust to allow subdivision of the sample so as to 
separately constrain the evolving bias of distinct classes of 
galaxies. As indicated by the modest size of the error bars 
in the comparison of mock catalogs, clustering constraints 
derived from the DEEP2 survey are expected to have only 
modest cosmic variance. They will thus supersede the cur- 
rent contradictory constraints on the strength of high red- 
shift galaxy clustering derived from the modest surveys 
undertaken to date. As shown in figure 4, each slice of 
the DEEP2 survey should lead to a 10% estimate of the 
correlation length of clustering tq. Subdivision into differ- 
ing samples will not degrade the measurement precision as 
rapidly as n" 1 ' 2 , because cosmic variance is dominant in 
this estimate, not Poisson noise. 

The tests described above demonstrate the power of the 
DEEP2 survey for refined estimates of redshift space dis- 
tortions in the distant Universe. The high resolution of the 
DEIMOS spectra will yield high quality redshifts, thus en- 
abling the study of the kinematics internal to the galaxies 
and the kinematics of small scale clustering at z = 1. It 
should be readily possible to measure all these quantities 
within the survey volume, although the estimated CT12 is 
likely to be too noisy to set useful constraints. We note 
that <7i is measureable to 10% precision, while CT12, after 
averaging over all 6 mock slices, is measurable to only 20% 
precision. The Kaiser infall effect on large scales should 
yield a 10% measurement of /?, when averaging over the 
full survey. Estimates of the infall from subsamples of the 
DEEP2 survey will not degrade as rapidly as the Pois- 
son noise, since this statistic is also dominated by cosmic 
variance. 

Precise constraints on the evolution of galaxy correla- 
tions will not only greatly improve our understanding of 
galaxy formation in hierarchical models, they will also fa- 
cilitate novel cosmological tests of considerable interest. 
For example, the counting of galaxies in the classic dN/dz 
test (Newman & Davis, 2000; 2001) is capable of set- 
ting good constraints on cosmological parameters such as 
w = P/p versus Q m , but serious degeneracy will remain 
even after counting 10 4 galaxies in the range 0.7 < z < 1.5. 
The magnitudc-redshift relation derived from distant su- 
pcrnovae leads to constraints with very similar degeneracy 
between w and Q m . To a considerable degree this degen- 
eracy can be broken by additional, complementary tests. 
For example, the expected ratio of the J3 integral of £(r) 
at z = 1 compared to that at z = has a different depen- 
dency on these same cosmological parameters, leading to 
contours of constant J3 ratio that cross those of the dN/dz 
test. Further tests, such as the abundance of clusters of 
galaxies, lead to quite different constraints in this same 



5. CONCLUSIONS 

The clustering of galaxies at high redshift is intimately 
connected to our incomplete understanding of galaxy for- 
mation, and both disciplines will advance as we analyze the 
details of clustering in the soon-to-begin DEEP2 survey. 
In order to study the precision with which one can expect 
to measure the clustering of galaxies at z ~ 1, we have 
generated a series of mock catalogs of the DEEP2 galaxy 
redshift survey. The mock catalogs are derived from two 
separate VIRGO simulations, one with continuous evolu- 
tion of the underlying structure but inadequate mass res- 
olution, and another with adequate mass resolution but 
which required the stacking of numerous simulation vol- 
umes to achieve a size sufficient to match the redshift range 
of the survey. 

The DEEP2 survey will use multislits, rather than mul- 
tffibcrs, to multiplex spectroscopic targets. The necessity 
of avoiding overlapping spectra in the survey limits the 
sampling efficiency of spectroscopic target selection, es- 
pecially in regions of the sky with higher than average 
surface density of galaxies. The tentative selection algo- 
rithm for the survey will lead to ~ 70% success rate in 
target selection. Although the resulting bias substantially 
affects the angular correlations of the selected targets, the 
correlations observed in redshift space are only modestly 
affected, and the bias can be largely removed by double 
counting galaxies closest on the sky and within the pho- 
tometric redshift error to those galaxies not selected for 
spectroscopy. 

Distortions in redshift space as measured by the £(r p , n) 
diagram will be cleanly detected in the DEEP2 survey, 
and we expect to measure both the "fingers of god" on 
small scales and coherent infall motions of galaxies on large 
scales. Additional tests of velocity field effects, such as the 
<7i test of Davis, Miller, & White (1997), will be readily 
possible with the sample and will be much less affected by 
the bias in the target selection algorithm. 

We expect to be able to measure as to within 10% in 
individual slices of the survey. Assuming a cosmological 
model, we can detect the evolution of relative bias with 
4-5cr significance within the volume of the DEEP2 survey 
itself, with more precise measurements for the brightest 
subsample of galaxies. 

The survey geometry is designed to allow a good mea- 
sure of the three-point clustering amplitude for scales 
r < lOh^ 1 Mpc, and tests of the mock catalogs indicate 
that this will indeed be possible. Such a measure will allow 
an independent test of the bias in the galaxy distribution. 

The DEEP2 survey is designed to be a comprehensive 
study of the Universe at z = 1. Having a sample of high 
redshift galaxies comparable in quality to those available 
locally will enable many independent tests of the evolu- 
tion of structure in the Universe. With numerous cross 
checks and subdivision of the redshift catalogs into dif- 
ferent classes of galaxies, we should be able to untangle 
evolution in the galaxy bias from evolution in the under- 
lying matter distribution. 
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Fig. 1. — One of our mock galaxy catalogs constructed from the Virgo Consortium Hubble Volume simulations, shown in both real and 
redshift space. This lightconc output has the advantage of continuous evolution, but the mass resolution is poor. 





Fig. 2. — a. The redshift selection function which was applied to the Hubble Volume mock catalogs, b. A histogram of the redshift 
distribution of galaxies in one of the Hubble Volume mock catalogs. 
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Fig. 3. — One of our GIF mock galaxy catalogs shown in redshift space. In the top panel are all the galaxies in this mock catalog, while 
only those targeted for slitmasks arc shown in the middle panel. The galaxies for which we would not obtain spectra are in the lower panel. 
This simulation has better mass resolution than the Hubble Volume simulation and has semi-analytic models applied. We used the known 
absolute luminosity of each galaxy to create a flux-limited sample of 7=23.4 mag. 
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Fig. 4. — The mean redshift-spacc two-point correlation function £(r p ,7r) of our six GIF mock galaxy catalogs is shown in solid lines, with 
la errors in dotted lines. The data have been smoothed with a 3-pixel boxcar in each direction. The contour levels are 0.25, 0.5, 0.75, 1.0 
(bold contour), 2.0, and 5.0. The scale length of the clustering is ~ 6 h~ 1 Mpc, as seen on the x-axis. 
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Fig. 5. — The effects of our target selection algorithm on the two-point correlation amplitude on our GIF mock catalogs. The solid contours 
show £(r p ,7r) for all the galaxies in the catalog, with contours levels of 0.25, 0.5, 0.75, 1.0 (bold), 2.0, 5.0. The data have been smoothed with 
a 3-pixel boxcar in each direction. The dotted contours are for only those galaxies selected to be observed. ^(r p ,n) of the targeted galaxies 
underestimates the true correlation amplitude. Dashed contours show a simple correction for untargetcd galaxies where the redshift of the 
nearest neighbor on the sky within the expected photometric redshift error is used for the untargetcd galaxy. 
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Fig. 6. — Redshift distortions seen in the two-point correlation function arc quantified by the quadrupole-to-monopole ratio £2/^0 °f £( r p, "■)> 
plotted for all the galaxies in the mock catalogs as a solid line with la errors. The dash-dot line is for only those galaxies selected to be 
observed on slitmasks, and the dashed line show the results of a simple correction for untargctcd galaxies. Dotted lines show models of ^2/^0 
for different values of (3. 
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Fig. 9. — The three-point correlation function as a function of the hierarchical term £1^2 + £2§3 + £3^1 in the redshift range 2=0.7—0.9. The 
open squares display the measurements for all the galaxies in the mock catalogs, while the solid squares show measurements for only those 
galaxies targeted by the 2-pass slitmask algorithm. The data were binned in terms of the heirarchical term, and the errorbars correspond to 
the dispersion of the six mock catalogs as well as scatter within the bin from one catalog. The error bars on the solid squares are comparable 
to those displayed but are not shown for clarity. The dotted line corresponds to a simple hierarchical model with Q3 = 1. 
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Fig. 10. — a. The mean evolution of <x g as measured for galaxies in the Hubble Volume mock catalogs, la errors arc plotted in dashed 
lines. The dash-dot line shows the expected linear evolution of erg of the underlying dark matter distribution, normalized to ag of 0.9 at z=0. 
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Fig. 11. — a. The mean evolution of erg as measured for galaxies in the GIF simulation mock catalogs, b. The change of bias with redshift. 
Here the increase of bias at high-z is not solely the result of evolution, as the galaxies at high-z in our sample are more luminous and therefore 
have a higher degree of bias on the whole. 
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Fig. 12. — The B-band luminosity function (LF) of one of our GIF mock catalogs in different redshift bins. The difference in the LF as a 
function of redshift is due mainly to the selection effects of our apparent magnitude-limited sample, though there is some intrinsic evolution 
of the LF present in the simulations. 
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Fig. 13. — Bias evolution for subsamples of the GIF mock catalogs selected by B-band magnitude. Here the change in bias with redshift is 
due to evolution within a specific galaxy type. The brightest galaxies are seen to have a larger bias and stronger evolution with z. 



